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REMARKS 

Applicants and the undersigned are most grateful for the time and effort accorded 
the instant application by the Examiner. The Office is respectfully requested to 
reconsider the rejections presented in the outstanding Office Action in light of the 
following remarks. 

Applicants note the Office Action does not acknowledge the claim for foreign 
priority made at the time of filing. Applicants also Submitted a certified copy of the 
priority document at filing. Acknowledgement is respectfully requested in the next 
communication from the Office. 

Claims 19-49 were pending in the instant application at the time of the 
outstanding Office Action. Of these claims, Claims 19, 25, 3 1, 39 and 47-49 are 
independent claims; the remaining claims are dependent claims. The independent claims 
have been rewritten. Applicants intend no change in the scope of the claims by the 
changes made by these amendments. It should also be noted these amendments are not in 
acquiescence of the Office's position on allowability of the claims, but merely to expedite 
prosecution. 

Claims 25-30, 39-41, 43-47 and 49 stand rejected under 35 USC § 102(b) as being 
anticipated by Kimber et al. Reconsideration and withdrawal of this rejection is 
respectfully requested. 
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As best understood, Kimber et al. appears to be directed to a method of clustering 
speaker data from a plurality of unknown speakers in conversational data. (Abstract; Col. 

1, lines 26-28) While the identify of the speakers is not known in Kimber et al., speech 
segments from each speaker appear to be clustered together. An index for the complete 
recording is then created by collecting the various segments that are similarly marked by 
an individual. (Col. 12, lines 17-19) 

Creating an index for a complete conversation, or for that matter, unknown 
speakers, stands in stark contrast to the present invention. As discussed in the 
specification, there are various issues with training speech recognition systems to deal 
with multiple speakers. Prior efforts to automize the indexing of audio material, e.g., 
using prior are Speech recognition technology, thus failed due to the large variability of 
speech styles and diaiects of the human individuals engaged in those interactions. (Page 

2, lines 8-10) Thus, the idea underlying the present invention is to locate segments in a 
continuous audio stream where a change-over to at least one predefined speaker occurs 
and to apply speech recognition or voice control techniques only to those audio segments 
belonging to the predefined speakers. (Page 3, lines 13-16) In particular, the present 
invention proposes to apply known speaker recognition techniques to conversations 
between a well-known speaker and a multitude of unknown speakers and thereby allows 
to transcribe only the utterances of the well-known speaker as an index and summary of 
the dialogues. (Page 4, lines 7-10) 

Claim 1 has been rewritten to recite, inter alia, identifying a known speaker from 
among the plurality of speakers and transcribing at least part of the continuous audio 
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stream if the known speaker is recognized, (emphasis added) Similar language also 
appears in the other Independent Claims. 

. It is respectfully submitted thai Kimber et al. clearly falls short of present 
invention (as defined by the independent claims) in that, inter alia, it does not disclose 
identifying a known speaker from among the plurality of speakers and transcribing at 
least part of the continuous audio stream if the known speaker is recognized Accordingly, 
Applicants respectfully submit that the applied art does not anticipate the present 
invention because, at the very least, "(anticipation requires the disclosure in a single 
prior art reference of each element of the claim under construction." W.L. Gore & 
Associates, Inc. v. Garlock, 111 F.2d 1540, 1554 (Fed. Cir. 1983); see also In re 
Marshall, 198 U.S.P.Q. 344, 346 (C.C.P.A. 1978). 

Claims 19-24, 31-38, 42 and 48 stand rejected under 35 USC § 103(a) as obvious 
over Kimber et al. in view of Glickman et al. Specifically the Office asserted that "[i]t 
would have been obvious ... to modify Kimber et al. by incorporating the teaching of 
Glickman et al. in order to provide automatic closed-caption using speaker-dependent 
models to enhance speech recognition accuracy." Reconsideration and withdrawal of the 
present rejections are hereby respectfully requested. 

A 35 USC 103(a) rejection requires that the combined cited references provide 
both the motivation to combine the references and an expectation of success. Not only is 
there no motivation to combine the references, no expectation of success, but actually 



- 14- 



PAGE 18/20 ' RCVD AT 3/16/2005 11:23:43 PM [Eastern Standard Time] * SVR:USPTOIFXRN/0 ' DNIS:8729306 * CSID:412 741 9292 1 DURATION rjnm-ss):04-30 



03-16-' 05 23:26 FROM- 412-741-9292 T-150 P019/020 F-406 

Atty. Docket No. DE920000055US1 

(590.080) 

combining the references would not produce the claimed invention. Thus, the claimed 
invention is patentable over the combined references and the state of the art. 

Glickman et al. does not overcome the deficiencies of Kimber et al. set forth 
above. In that regard, Glickman et al. "develop[s] separate acoustic-phonetic models ... 
for [each of] the multiple speakers." (Col. 5, lines 51-52) Glickman et al. continues that 
"[a]fter the models are 'trained', speaker recognition can automatically be performed" 
and *'[t]his technique can also be used to perform 'automatic' closed captioning." (Col. 5, 
lines 52-55) Glickman et al. thus teaches transcription of the speech of all speakers 
involved in a particular conversation. 

There is an inherent tension in Kimber et al. and Glickman et al. given that in 
Kimber et al. all of the speakers are unknown and in Glickman et al. all of the speakers 
are known, which teaches away from combining these two references. At best, however, 
combining Kimber et al. and Glickman et al. would result in continuously training models 
for each speaker so that the speech of all participants in a conversation would be 
transcribed and displayed as closed captioning. Even if there were a motivation for the 
combination, this combination does not teach or suggest the claimed invention. 

In view of the foregoing, it is respectfully submitted that Independent Claims 19, 
25, 3 1, 39 and 47-49 fully distinguish over the applied art and are thus allowable. By 
virtue of dependence from Claims 19, 25, 31 and 39, it is thus also submitted that Claims 
20-24, 26-30, 32-38 and 40-46 are also allowable at this juncture. 
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In summary, it is respectfully submitted that the instant application, including 
Claims 19-49, is presently in condition for allowance. Notice to the effect is hereby 
earnestly solicited. If there are any further issues in this application, the Examiner is 
invited to contact the undersigned at the telephone number listed below. 



Respectfully subrnitted, 





Stannsjtp^JFerence III 
Registration No. 33,879 

Customer No, 35195 

FERENCE & ASSOCIATES 

400 Broad Street 

Pittsburgh, Pennsylvania 15143 

(412) 741-8400 

(412) 741-9292 -Facsimile 

Attorneys for Applicants 
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